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NEW PROTEIN FROM URINE NAMED COMPONENT B 



The present invention relates to a new protein named Component B. 
In particular the invention relates to a new protein obtainable from 
urine, its preparation from urine, its production by recombinant DNA 
techniques using genomic DNA or cDNA encoding said new protein, as 
5 well as pharmaceutical compositions containing it and its use in 
therapy. 

A new protein was isolated during the extraction and purification 
process of urine derivatives, this protein shows a polypeptide 
nature and relatively low molecular weight. When humane urine is 

10 treated with adsorbing materials, as kaolin, and then undergoes 
filtration, ion exchange chromatography and high resolution 
chromatography, preferably according to the process hereafter 
described, after lyophilisation a compound is obtained as amorphous 
white powder, moving as a single peak in high pressure reversed 

15 phase liquid chromatography (HPLC-RP) and having a molecular weight 
of about 9KDa when analysed by electrophoresis on a polyacrylamide 
gel in the presence of sodium dodecyl sulphate (SDS-Page) under 
reducing conditions. This protein was named, and is referred to 
hereinafter. Component B. 

20 Component B is more specifically characterised through the amino 
acid sequence reported as SEQ ID NO: 1. 

The present invention makes therefore available a new protein, named 
Component B. obtainable through a process comprising the isolation 
of a raw fraction of the compound itself from a dialysed concentrate 
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of urine after treatment with an adsorbing agent and its 
purification by ion exchange chromatography and high resolution 
chromatography as described hereafter. 

Preferably the protein according to the present invention is 
5 extracted from human urine because of its high amount available 
useful for industrial production. The present invention refers 
particularly to a polypeptide comprising the SEQ ID NO: 1, its 
salts, functional derivatives, precursors and active fractions as 
well as its active mutants, i.e. other proteins or polypeptides 
10 wherein one or more amino acids of the structure were eliminated or 
substituted by other afflino acids or one or more amino acids were 
added to that sequence in order to obtain polypeptides or proteins 
having the same activity of Component B and comprises also the 
corresponding fusion proteins i.e. polypeptides comprising Component 
15 B or a mutation thereof fused with another protein and having a 
longer lasting half-life in body fluids. Component B can therefore 
be fused with another protein such as. for example. an 
immunoglobulin. 

The definition "salts" as used herein refers both to salts of the 
20 carboxyl-groups and to the salts of the amino functions of the 
compound obtainable through known methods. 

The salts of the carboxyl-groups comprise inorganic salts as. for 
example, sodium, potassium, calcium salts and salts with organic 
bases as those formed with an amine as triethanolamine , arginine or 
25 lisine. The salts of the amino groups comprise for example salts 
with inorganic acids as hydrochloric acid and with organic acids as 
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acetic acid. The definition "functional derivatives" as herein 

used refers to derivatives which can be prepared from the functional 
groups present on the lateral chains of the amino acid moieties or 
on the terminal N- or C- groups according to known meithods and are 
5 comprised in the invention when they are pharmacetically acceptable 
i.e. when they do not destroy the protein activity or do not impart 
toxicity to the pharmaceutical compositions containing: them. 
Such derivatives include for example esters or aliphatic amides of 
the carboxyl-groups and N-acyl derivatives of free amino groups or 
10 0-acyl derivatives of free hydroxyl-groups and are formed with acyl- 
groups as for example alcanoyl- or aroyl-groups . 

The "precursors" are compounds which are converted into the 
Component B in the human or animal body. As "active fractions" of 
the protein the present invention refers to any fragment or 

15 precursor of the polypeptidic chain of the compound itself, alone or 
in combination with related molecules or residues bound to it, for 
example residues of sugars or phosphates, or aggregates of the 
polypeptide molecule when such fragments or precursors show the same 
activity of Component B as medicament. 

20 The present invention refers also to a mixture of polypeptides and 
derivatives as said above, 

A second aspect of the present invention concerns the process of 
preparation of Component B, such process comprising the isolation of 
a raw fraction of the protein from a dialysed concentrate of urine 
25 after treatment with an adsorbing agent and its purification through 
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ion exchange chromatography and high resolution chromatography. 
Preferably, Component B is prepared through the process illustrated 
in Figure 1 and comprising the following steps: 

a) adsorption of urine at acid pH on kaolin and extraction with 
ammonia 

b) elution of fraction (a) on Bio Rex 70 resin with ammonia 

c) elution of fraction (b) on DEAE Sepharose resin with acetate 
buffer 

d) elution of fraction (c) on CM Sepharose resin with acetate buffer 

e) elution of fraction (d) on HPLC Cl8 resin with a mixture of 
acetate buffer and acetonitrile 

f) elution of fraction (e) on DE-52 resin with acetate buffer 

g) elution of fraction (f) on D-Zhephyr resin with acetate buffer 

h) elution of fraction (g) on HPLC Cl8 resin with a mixture of 
15 aqueous trif luoroacetic acid and acetonitrile 

i) elution of fraction (h) on D- Zephyr resin with acetate buffer. 
The present invention refers also to recombinant DNA molecules which 
comprise the nucleotidic sequence encoding the polypeptide according 
to the invention, its active mutants or fusion proteins, expression 

20 vectors which comprise it. host-cells transformed with such vectors 
and a process of preparation of such polypeptide, its active mutants 
or fusion proteins, through the culture in appropriate culture 
media of said transformed cells. The definition "recombinant DNA 
molecules" include genomic DNA. cDNA. synthetic DNA and combinations 

25 thereof. In particular the present invention refers to the 
nucleotide sequences illustrated in SEQ ID NO: 2 and SEQ ID NO: 3 
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respectively. 

SEQ ID NO: 2: reports the genomic DNA sequence encoding Component B; 
Figure 2 reports the restriction map of Component B transcriptional 
unit; 

SEQ ID NO: 3: reports the cDNA sequence encoding Component B; 
Figure 8 shows the complete Component B cDNA sequence, in which the 
restriction sites are indicated. The cloning of Component B can be 
performed through different techniques. According to one of these 
techniques an oligonucleotide, or a mixture of oligonucleotides, are 
prepared, their sequence being derived from the sequence of 
Component B or its fragment and used as probe for cloning the cDNA 
or the genomic DNA encoding Component B. 

SEQ ID NO: ^: reports the amino acids sequence encoded both by the 
genomic DNA reported in SEQ ID NO: 2 and by the cDNA reported in SEQ 
15 ID NO: 3. 

The present invention also refers to recombinant DNA molecules which 
hybridize with the DNA sequence coding for Component B or fragments 
thereof. 

The gene can contain, or not, the natural introhs and can be 
obtained for example by extraetion from appropriate cells and 
purification with known methods. Appropriate preparations of DNA, as 
human genomic DNA. are cut in the appropriate way. preferably with 
restriction enzymes, and the so obtained fragments are introduced in 
appropriate recombinant vectors in order to form a DNA library. Such 
vectors can be selected with synthetic oligonucleotide probes in 



20 



25 
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order to identify a sequence encoding Con>ponent B according to the 

invention. 

In particular, according to the present invention, the genomic DNA 
of Component B was isolated and cloned. 
5 On the other hand, the corresponding mRNA can be isolated from the 
cells expressing Component^B and used to produce the complementary 
DNA (cDNA) with known methods. This cDNA after having been converted 
in the double helix, can be introduced in an apropriated vector 
which can afterwards be used for transforming an appropriated host 
10 cell. The resulting cultures are then selected with an appropriate 
probe in order to obtain the cDNA encoding the targeted sequences, 
once the wanted clone is isolated, the cDNA can be manipulated 
essentially in the same way as the genomic DNA. 
The cDNA does not contain introns. 
15 Because of the degeneration of the genetic code, various codons can 
be used for encoding a specific amino acid, so that one or more 
oligonucleotides can be produced, each of them being able to encode 
fragments of Component B. However only one member of this pool 
possesses the nucleotide sequence identical to that of the gene. Its 
20 presence in the pool and its capacity of hybridizing with the DNA 
also in the presence of other members of the pool makes it possible 
the use of the group of non fractioned oligonucleotides in the same 
way as a single oligonucleotide could be used for cloning the gene 
encoding the targeted peptide. Alternatively. a single 
25 oligonucleotide containing the sequence which is theoretically the 
most probable being able.; of encoding the genie fragments of 
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Component B (according to what described in the "rules for the use 
of codons" in Lathe R, at al. J .Molec.Biol, l83:l-12 (1985)) allows 
the identification the complementary DNA encoding Component B or a 
fragment thereof. 

5 The processes for hybridizing the nucleic acids are known and 
described, for example in Maniatis T. et al. Molecular Cloning: A 
laboratory manual. Cold Spring Harbor Press, Cold Spring Harbor, NY, 
1982) and in Haymes B.T. et al . Nucleic Acid Hybridization: A 
practical approach, IRL Press, Oxford, England, (I985) . Through the 

10 hybridization using said probe or group of nucleotide probes it is 
possible to identify in a genomic or cDNA gene library the DNA 
sequences capable of such hybridization which are thereafter 
analysed to confirm that they encode the polypeptide according to 
the invention (i.e. Component B) . The oligonucleotide which 

15 contains such complementary sequence can be synthetized and used as 
probe to identify and isolate the gene of the polypeptide according 
to the invention i.e. Component B (Maniatis T, et al. ibid.). 
Once the appropriate oligonucleotide specific for Component B is 
selected using the above said method, it is possible to synthetize 

20 and hybridize it with a DNA, or preferably with a cDNA derived from 
cells capable of expressing the wanted gene preferably after the 
source of cDNA was enriched of wanted sequences, for example by 
extraction of the RNA from cells producing high levels of the wanted 
gene and conversion of the RNA into the corresponding cDNA using the 

25 enzyme reverse transcriptase. 
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Alternatively, the suitable oligonucleotides specific for Component 
B can be synthesised and used as primers for the amplification of 
Component B cDNA fragments^ by RACE-PCR (M..A. Innis et al.. PGR 
Protocols, A Guide to Methods and Applications, Academic Press, 
5 1990). 

In particular, according to the present invention, a screening of 
different human and cellular tissues was performed firstly in order 
to identify the best available source for mRNA of Component B. Human 
tissues from brain, kidney, liver, lung, heart, pancreas, placenta, 

10 spleen, testis, thymus and uterus as well as epitheliod carcinoma, 
promyelocytic leukemia, breast adenocarcinoma, Burkitt's lymphoma 
and myeloma cell lines were screened for this purpose. 
The screening was performed by using a sensible assay "reverse 
transcriptase - polymerase chain reaction" (RT - PGR) . 

15 The human uterine tissue resulted the best source of mRNA. 

The cDNA clones of Component B were obtained by said tissue using 
the amplification method named "3' and 5' rapid Amplification of 
cDNA Ends" (RACE) . 

The DNA molecules encoding Component B, obtained with the above said 
20 method, were introduced in expression vectors constructed with known 
techniques (Maniatis T et al. ibid.). The double helix cDNA is 
ligated to plasmid vectors using, for example, techniques comprising 
the use of synthetic DNA adapters or techniques of binding "blunt- 
ended". 

25 For the expression of a targeted protein, an expression vector 
should comprise also specific nucleotide sequences containing the 
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information regulating transcription and translation bound to the 
DNA encoding the desired protein in such a way that the expression 
of the gene and the production of the protein are permitted. First 
of all , in order that the gene can be transcribed , i t mus t be 
5 preceded by a promoter which can be recognised by the RNA polymerase 
and to which the polymerase binds, thus starting the transcription 
process . 

Many promoters are known operating with different efficency (strong 
and weak promoters) which are different if used in prokaryotic or 
10 eukaryotic cells. 

The promoters which can be used in the present invention can be 
constitutive, as for example promoter int of lambda bacteriophage, 
promoter Bla of the gene of p-lactamase of pBR322 and the promoter 
CAT of the gene of chloramphenicol acetyl transferase of pPR325 ecc, , 
15 or inducible as for example the promoters of prokaryotes as the main 
right and left promoters of lambda bacteriophage (PI and Pr) , the 
promoters trp. rec A , lac Z . lac I , ompF and gal of E^coli, or the 
hybrid promoter trp*lac . etc. (Glik B.R. J. Ind, Microbiol. 1:277- 
282 (1987). 

10 Together with the strong promoters which are capable of producing 
huge quantities of mRNA, ^\y±r\^ high levels of gene expression in 
prokaryotic cells, it is necessary to use also binding sites for the 
ribosomes in order to assure that the mRNA is efficently translated. 
An example is given by the Shine-Dalgarno (SD) sequence positioned 

5 in appropriate way from the starting codon. 
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For eukaryotic host cells different sequences regulating the 
transcription and translation can be used according to the nature of 
the host. 

These can be derived from viral sources, as adenovirus, or papilloma 
virus. Simian virus or similar, wherein the regulation signals are 
associated to a specific gene having an high level of expression. 
Possible examples are the promoter TK of the herpes virus, the 
promoter of SVAO. the promoter of gene galU of yeast ecc. The 
Signals regulating the starting of transcription can be suitably 
chosen in order to produce repression or activation in such a way 
that the expression of the genes can be accordingly modulated. 
The DNA molecule comprising the nucleotide sequence encoding 
component B of the invention together with the signals regulating 
transcription and translation are introduced in a vector which is 
capable of integrating the sequences of the targeted gene in the 
host cell chromosome. 

The cells which bear the introduced DNA in their chromosome can be 
selected also introducing one or more markers which make it possible 
CO select the host cells containing the expression vector. The 
marker can provide the cells, for example, with antibiotics 
resistance or heavy metal (as copper) resistance. The selection gene 
can be directly bound to the DNA sequences which must be expressed 
or can be introduced in the cell itself by co trans fection . Other 
elements may also be necessary for a higher gene expression. These 
elements can comprise for example transcription enhancers and 
termination signals and introns. Expression vectors which include 
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such elements comprise those described by Okayama H. Mol. Cell. 
Biol, 3: 280 (1983). 

Among the factors to be considered for chosing a particular plasmid 
or viral vector are: the facility of detection of the cells 
5 containing the vector which can be easily separated from those which 
do not contain it; the number of copies of vectors which are wanted 
in a specific host, and the possibility, or not, of transferring the 
vector among different host cells. 

The preferred prokaryotic vectors comprise plasmids as those capable 
10 of replication in E.coli , as pBR322, ColEl , pSClOl , pACYC 184 etc. 
(Maniatis T. et al, ibid,). Bacillus plasmids as pC194, pC221, pT127 
etc. (Gryczan T.M. The Molecular Biology of the Bacilli, Academic 
press. Ny, 307-329 (1982) Streptomyces plasmids as pIJlOl (Kendall 
K.J, et al, J. bacterid. I69:4l77 - 4l83) and Pseudomonas plasmids 
15 (John J,F. et al. Rev. Infect. Dis. 8: 693-704 (1986) (Izaki K, Jpn. 
J. Bacteriol. 33: 729 - 742). 

The preferred eukaryotic vectors comprise, for example, BPV, SV40. 

Baculovirus etc, or their derivatives. Such vectors are known in the 

art (Bostein D. et al, Miami Wint Symp. 19: 265 - 274) (Broach J.R. 
20 The Molecular Biology of the Yeast Saccharomyces : Life Cycle and 

Inheritance . Cold Spring Harbor , NY , 455-470 ( I98I ) ( Broach J.R. 

cell 28:203-204 (1982) (Bollon D.P. et al. J. Clin. Hematol. Oncol. 

10: 39-48 (I98O) (Maniatis T. Cell Biology: A Comprehensive Treatise 

Vol. 3: Gene Expression Acad. Press NY 563 - 6O8 (I98O). 
25 The expression vector so prepared is introduced in the appropriate 
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host cell with an appropriate method such as transformation, 
transfection. lipofection. conjugation, protoplastic fusion, 
electrophoration. precipitation with calcium phosphate, direct 
microinjection etc. The host cell which can be used for the present 
invention can be prokaryotic or eukaryotic cells. 

Preferred prokaryotes include bacteria as E^Coli. Bacillus, 
streptomyces. pseudomonas. Salmonella. Serratia. etc. 
Particularly preferred is E.coli. as for example strain 29^ of 
E.coli K12 (AtCC or E.Coli X1776 (ATCC 31537). EXoli W 3110 

(F. laiDbda. ATCC 27325)- 

Preferred eukariotic host cells are mammalian cells as human, 
monkey, mouse or hamster (Chinese Hamster Ovary. CHO) cells since 
they assure to the protein molecules post- translation modifications, 
as for example the correct folding and glycosylation in the right 

positions . 

Yeast cells can be also used for the present invention. There are 
various recombinant DNA techniques which utilize sequences of strong 
promoters and a high number of copies of the plasmid and allow the 
production of the wanted protein in yeast. 

After the introduction of the vector in the host cells these are 
cultivated in a medium which allows the selective growth of cells 
containing the vector. 

The expression of the cloned DNA sequence allows the production of 
Component B. of a mutant or fragment thereof. The so expressed 
protein is isolated or purified through conventional techniques 
25 comprising extraction. ; precipitation. chromatography, 
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electrophoresis, or similar techniques, or affinity chromatography, 
using anti-Component B antibodies immobilised on the column gel. 
Component B can also be produced as milk-secreted protein in 
transgenic animals, 

A further aspect of the present invention is the use of Component B, 
its salts, functional derivatives, precursors or active fractions as 
medicament. 

In particular Component B has shown anti-inflammatory, anti- 
coagulant and anti-tumoral properties. Furthermore Component B can 
be useful in the therapy of pathologies correlated with altered 
levels of TGF-alpha, such as behavioural and hormonal disturbances, 
angiogenesys , etc.. 

In fact. Component B has been shown to inhibit the binding of TGF- 
alpha to its receptor with an affinity constant is = 0.77 * 10'^^ 
M measured by displacement of I^^^ - TGF-alpha from its receptor, 
obtained from A ^31 cell membranes. 

The pharmaceutical compositions containing a therapeutically active 
quantity of Component B in combination with pharmaceutically 
acceptable excipients or eluents are also an object of the present 
20 invention. Such compositions can be formulated for oral, rectal, 
nasal and particularly parenteral administration. 

Also the topic use of Component B is included in the present 
invention. 

The formulations according to the invention include also retard 
25 forms as subcutaneous implantations based on liposomes or 
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microcapsules of copolymers of lactic and glycolic acids. 

Other aspects of the invention will be evident in the light of the 

following detailed description. 

Example 1 - Process of preparation of Component B from human urine 
5 The preparation and purification of Component B from human urine is 
summarized in Fig, 1. ^ 

a) STEP 1 

The starting material is human urine to which HCl is added up to pH 
3.0. After decantation of the precipitate, kaolin is added to the 
10 urine (10 g/1 of starting urine). 

The suspension is left for l6 hours and is thereafter centrifugated . 
The supernatant is eliminated and kaolin is extracted with ammonia 
2M pH 11.0. 

The ammonia eluate pH is brought to 8.0 and is concentrated by 
15 membrane ultrafiltration (cut off 1000 Daltons). The whole operation 
is performed at 4°C. 

b) STEP 2 

The solution of point (a) is added with acetic acid up to pH U.O and 
then with Big Rex 70 resin previously equilibrated at pH 4.0 with 
20 acetic buffer. 

The solution is left under stirring for 4 hours and is then filtered 
on pressfilter. 

The adsorbed material is eluted from the Bio Rex 70 resin through 
elution with ammonia at pH 9-0. 
25 The chromatography eluate is concentrated by membrane 
ultrafiltration (cut off 1000 Daltons). 
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The whole operation is performed at Vc. 

c) STEP 3 

The material of point (b) , equilibrated in acetic buffer pH 5,6 is 
adsorbed on ion exchange resin like DEAE Sepharose, previously 
equilibrated at pH 5* 6. 
5 At the end of the adsorption elution is performed using ammonium 
acetate buffer 0.5 M at pH 5*6. The chromatography eluate is 
concentrated by membrane ultracentrifugation (cut off 1000 Daltons). 
The whole operation is performed at ^"C. 

d) STEP 4 

10 The material of point (c) is equilibrated with acetate buffer at pH 
4,5 and adsorbed on ion exchange resin like CM Sepharose previously 
equilibrated at pH 4.5- 

When the adsorption is completed eluation is performed with ammonium 
acetate buffer 0.15 M pH 4.5- The chromatography eluate is 
15 concentrated by membrane ultrafiltration (cut off 1000 daltons). The 
whole operation is performed at U*C. 

e) STEP 5 

The material of step (d) is purified at 25°C by reverse phase 
chromatography on HPLC Cl8 resin equilibrated in ammonium acetate 
20 buffer 0.05 M pH 5-6. 

The adsorbed material is eluted from the resin with an ammonium 
acetate solution containing acetonitrile 30% (v/v) . 

The chromatography eluate is concentrate by distillation (40*C) 
under vacuum. 
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f) STEP 6 

The material of step (e) is purified on ion exchange resin like DE- 
52, equilibrated at pH 5,6 in ammonium acetate buffer 0,02 M. 
The elution of the adsorbed material is performed with buffer 0.25 
5 M. The concentration is performed by membrane ultrafiltration (cut 
off 1000 Daltons). 

The whole operation is performed at U*C. 

g) STEP 7 

The material of step (f) is purified on ion. exchange resin like D- 
'^^ Zephyr prepacked column (sold by Sepracor) , equilibrated at pH 6.2 
in 20 mM sodium acetate buffer solution (buffer A), 

The elution of the absorbed material is performed by gradient 
elution from 100% buffer A to 100% 20mM sodium acetate buffer 
solution at pH 6.2 containing IM NaCl. 
1^ h) STEP 8 

The material of step (g) is purified by reversed phase 
chromatography at 25 "C on resin CI8 like HPLC, 

After adsorbtion the elution is perfomed with linear gradient formed 
by a binary mixture of aqueous solution of trif luoroacetic acid (TFA 
20 0,1%) and acetonitrile, acidified with TFA (0,1%). 

The chromatography eluate concentrated by distillation (^5**^) under 
vacuum and lyophilized. 
i) STEP 9 

Step 7 is repeated. 
2^ The final product. Component B, is recovered as an amorphous white 
powder . 
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Example 2 ■ Analytica l characterisation of component B 

In order to specify the main physical-chemical characteristics of 

Component B the purified material from urine underwent the following 

analytical controls. 

a) AMINO ACID SEQUENCE 

The amino acid sequence of Component B was determined according to 
the Edman method 

The analysis was performed using a sequencer Applied Biosystem. 

model «I77A. following the indications given by the producer. 

Such analysis gave the possibility of identifying the amino acid 

sequence of Component B relatively to the 8l amino acid residues 

reported in SEQ ID NO: 1. 

b) DETERMINATION OF MOLECULAR WEIGHT 

The analysis was performed by "Electron Spray - Mass Spectrometry" 
(ES-MS) and showed a molecular weight of 8937.9 Daltons. Such 
analysis has evidenced five disulfide bridges and an 80 Daltons 
residue attributable to an S0^ group bound to Tyr (39). 
Example Isolation o f Human Component B p^ennmir una 
A human genomic DNA library in lambda phage vector EMBL-3 SP6/T7 was 
purchased from Clontech (cat. No. HL 1067 J. Lot No. 1221). Genomic 
DNA was extracted from human placenta and partially digested with 
Sau 3A. DNA fragments were separated on a sucrose gradient to 
produce size range between 8 to 22 Kb before cloning into the BamHl 
site of EMBL-3 Sp6/T7 vector. 
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Culture media 

E. coli K802 cells, purchased fro. Clontech (cat. He. ClOO^-D. "ere 
cultured in LB «ediu. supple.ented with 10 «« mSO, and 0.2 i 

maltose (culture medium) . 

Phase library was diluted in 0.1 M NaCl. 8 H^O, . 50 - Tris-Cl 
pH 7-5. O.Dl% gelatin (SM) . ''f 

The DNA library was plated onto 1.5% agar-LB plates. Top agarose for 
library plating was: O.136 M NaCl. 0.6% agarose. 1% tryptone (Merck 
cat No. 7213) • 



Hybridization re agents 
20xSSC 

Hybridization solution 



25 



3 M NaCl. 0.3 M Na citrate. pH 7-0 
5xSSC. 0.02% SDS.0.1% N-lauroylsarcosine. 
0.5% Blocking reagent (Boehringer cat No. 
IO96I76) . 

washing solution A 3xSSC. O.U SDS. urea at various concen- 

(HRP-oligos) trations depending on the specific probe 

(see below) . 

washing solution B IxSSC. 0.1% SDS 

(32p.oiigo CBEXUL) 
Detection s.ystem 

HRP-oligo/DNA hybrids were detected by ECL kit and exposure to 

I- //*o^ Nn RPM 2106 and 2104, 
Hyperfilm ECL from Amersham (cat. No. 

respectively) . 

32p-oligoprobed filters were revealed by exposure to Hyperfil. p-.ax 

(AmershaiD cat. No. RPNIO) . 
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Oligonucleotides 

Oligonucleotides were synthesized by automatic DNA synthesizer 
{Applied Biosystem mod. 392}. 

Oligonucleotides were purified by OPC cartridges (Applied Biosystem 
5 cat. No. 400771) or by denaturing PAGE* 

Oligonucleotides CBl , CB2 and CBEX2L to be used as probes were 5* 
modified with N-MMT-C12-aminomodif ier (Clontech cat. No. 5206-1) 
during the last cycle of synthesis. 

Horse-radish peroxidase (HRP, Boehringer cat. No. 81^*393) was 
10 conjugated to the modified oligonucleotide according to M. S. Urdea 
(Nuc . Ac. Res . 16 . i*937 , 1988 ) , by using 1 . ^-phenylen- 
diisothiocyanate (Aldrich cat. No. 25.855-5) as homobifunctional 
crosslinking agent. 

HRP-oligop robes were purified by anion-exchange HPLC on a Nucleopac 
15 PA-100 column {Dionex cat. No. O^SOIO) . Elution was performed with 

20 mM Na phosphate buffer pH 6.0 and a linear gradient of NaCl from 
0.2 to 1 .0 M in 30 min. . 

Purified HRP-oligonucleotides were concentrated by Centricon 10. 
washed with PBS and stored at in the dark, 
20 The HRP-oligonucleotide concentration was calculated by OD (^Z|03 
=89.5 cm"^xmM"^) . 

The following oligonucleotides were synthesized: 
oligo # sequence target 

CBPUl 5'TGACTCACACGGCCGGTrCT promoter SEQ ID NO: 5 

CBPLl 5'CAGCCATGTCCAGTGGTCCT promoter SEQ ID NO: 6 

CBEX2L 5*ACCACAGCCCATGCTCCA exon 1 SEQ ID NO: 7 
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CBl 
CB2 
CB3 
CB5 



exon 3 
exon 3 
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5'TGCAGGAAGCACTGGTCAT exon 2 

5'TCTGGCTrGCAGCGGGTAATGGT exon 2 

5'ATGACCAGTGCTrCCTGC exon 2 

5 • ATCCCACCTGCTGCCTTTTG 
CBEX4U2 5 * CGGCGGCTGGGAGCAGT 
CBFl 5 • ATGGAATTCTAYCCATTYAAYCARTC 

CBF2 5 ' ATGGAATTCTAYCCATTYAAYCARAG 

CBRl 5 • GTAGAATTCGCGCCAATGGARTCNGGRTC exon 3 

CBR2 5 • CJTAGAATrCGCGCCAATGCTRTCNGGRTC exon 3 

CBEX4U 5 • AGTACCCCTTCAACCAGAG 
CBEXU L 5 • C AGACACC ATGAGTGAGCTG 
CBEX4U1 5 • GACACCTCCTCTGTGACGG 
CBEXU LI 5 ' CCAGTTCTGTAGGGTGTCAGT 
where R = A-G. Y = C-T and N = A-G-C-T. 
Library titration 

The human genomic DNA library was titred according to standard 
procedures (F. Ausubel. Current Protocols in Molecular Biology) by 
infecting 0.3 ml of an overnight culture of E. coli K802 cells with 
various dilutions of the library, in the range 2x10-3 to 2x10-7. 
Cell-library mixture was incubated at room temperature for 20 min. 
then transferred to 37°C for 10 min. Infected cells were mixed with 
^ ml of top agarose preheated at 50°C and poured onto a 10 cm agar 
plate prewarmed at 37 °C. Plates were incubated at 37 °C overnight 
(ON). 

The number of plaques was scored in each plate. Duplicate plates 



SEQ ID NO: 8 
SEQ ID NO: 9 
SEQ ID NO: 10 
intron 2 SEQ ID NO: 11 
intron 2 SEQ ID NO: 12 
SEQ ID NO: 13 
SEQ ID NO: iM 
SEQ ID NO: 15 
SEQ ID NO: 16 
SEQ ID NO: 17 
SEQ ID NO: 18 
SEQ ID NO: 19 
SEQ ID NO: 20 
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were prepared for each library dilution. The human genomic DNA 
library titer was found to be 5 x 10^ pfu/ml, as expected. 
Library serening 

E, coli K802 cells grown overnight at 37 '^C. 0.6 ml of cell culture 
were infected with a library aliquot (6 x 10^ pfu) suspended in SM. 
Infection and plating was 5perf ormed as above, but 9 ml of top 
agarose and 15 cm plates were used. 

Semiconfluent plaques were transferred onto Hybond N* nylon membrane 
(Amersham) , according to Amersham instruction manual. 
Blotted DNA was denaturated by placing the filters, plaque side up, 
onto a filter paper soaked in 1.5 M NaCl, 0.5 M NaOH for 7 min. 
Blotted DNA was then neutralized by placing the filters onto a 
filter paper soaked in neutralizing solution {I.5 M NaCl, O.5 M 
Tris-Cl pH 7.2, 1 mM EDTA) twice for 3 min each. 

Filters were washed in 2xSSC and air-dried. DNA was fixed to the 
15 membrane by placing the filters onto a filter paper soaked in OA M 
NaOH for 20 min. 

The filters were finally washed in 5xSSC for 1 min and stored in a 
plastic bag at up to hybridization. The human genomic DNA 



10 



library (1 x 10^ clones) was screened at high plating density with 
20 HRP-CB2 oligoprobe. 20 positive clones were selected. 

Six positive clones were rescreened with both HRP-CBl and HRP-CB2 
oligoprobes. Three clones, named 4D, 12B and I5, were confirmed to 
be positive for Component B gene. 
Hybridization 

25 Filters were preincubated at ^2''C for 30 min in the hybridization 
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solution, then hybridized with the appropriate HRP-oligoprobe (5 
ng/ml oligonucleotide moiety in hybridization solution) at U2°C for 
45 min and finally washed twice for 15 min each at 42°C in washing 
solution containing urea at the appropriate concentration (see 

3 below) . 

Flutters were washed briefly in 2xSSC at room temperature and the 
hybridized plaques were detected by ECL reagents and exposure to 
Hyperfilm for 60 min. 

Washing conditions for HRP-oligoprobed filters were experimentally 
10 determined to minimize unspecific hybridization to E. coli and 
lambda phage DNAs. Serial dilutions in the range 500 to 15 attomoles 
of target DNA were spotted on Hybond N* membrane in the presence of 
lambda DNA (10 ng). Lambda and E. coli DNAs (10 ng each) were used 
as negative controls. Several strips were prepared and used in 
15 hybridization experiments with 5 ng/ml probe. Washings were 
performed with washing solution A containing 0. 9. l8. 27 and 36% 
urea. 

m and 27% urea were found effective for CBl and CB2. respectively. 
Filters hybridized with CBEX2L were washed with washing solution A 

20 containing 18% urea. 

Hybridization with 32p_oiigo CBEX4L was performed at 50°C and 
filters were washed at 45 °C in washing solution B. 
Plaque subscreening 

Positive plaques were picked up by a Pasteur pipette and 
25 transferred to a tube containing 1 ml of SM plus a drop of 
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chloroform. After 2 hr incubation under shaking at room temperature, 
the phage suspension was stored at 4**C. 

A 10*^ dilution of the phage suspension was plated onto a 10 cm 
plate and rescreened on two replicate filters with two oligoprobes, 
5 i.e. the one used in the first screening and another one matching to 

an adjacent region of component B. 

Independent clones positive with both probes were picked up and 
resuspended as above. 
Preparation of phage stocks 

10 Positive clones were expanded by infecting E. coli K802 cells and 
growth on 15 cm agar plates. After ON incubation at the 
confluent lysate was collected from agar plates with 10 ml of SM. A 
few drops of chloroform were added, cell debris were removed by 
centrif ugation at 3000 rpm for 5 niin at ^**C and the clear 

15 supernatant containing the phages was brought to 50% glycerol, 
aliquoted and stored at -80* C. 
Extraction of phage DNA 

2x10^ E, coli K802 cells were infected with the selected phage clone 

(cell/phage ratio = ^:1) and grown in lOO ml liquid culture medium 
20 ON at 37**C. At the end of incubation, full cell lysis was 

accomplished by adding chloroform (5 pl/ml) to the culture. 

Phage DNA was extracted by Quiagen, according to the manufacturer 

instructions. 

Phage DNA sequencing 
25 Phage DNA was sequenced by cycle sequencing kit from Applied 

Biosystem (cat- No, 401388) with an automated DNA sequencer (Applied 
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Biosystem mod. 373A) • Phage DNA was extracted from clones UD. 12B 
and 15 and sequenced by cycle sequencing. 

Sequencing primers were derived either from the amino acid sequence 
of Component B (CBFl. CBF2. CBRl . CBR2) or from the available cDNA 
or genomic DNA sequencing data. 

Sequencing data indicated that the three clones contained the full - 
length Component B gene. 
Phage DNA restriction analysis 

Phage DNA was submitted to single and multiple restriction enzyme 
digestions. DNA fragments were resolved by 0.6% agarose gel 
electrophoresis and then blotted onto Hybond N* membrane. Filters 
were repeatedly probed with oligonucleotides CBEX2L. CB2 and CBEX4L. 
matching to exon 1. 2 and 3 respectively. 
Subcloning of Component B gene in p BlueScript II SK 

Restriction analysis of Component B clone 40 with EcoRl . Xho I and 
Sfi 1 and Southern blotting with oligoprobes specific for the three 
exons of Component B indicated that the entire Component B gene was 
comprised in a 5-2 Kb EcoRl fragment. (Fig. 5)- 

Phage DNA was estracted from clone 40 and digested with EcoRl. The 
resulting DNA fragments were resolved by agarose gel 
electrophoresis, the 5-2 Kb fragment was purified by Qiaex (qiagen 
cat. No. 20020) and ligated to EcoRl linearized pBlueSript II KS 
(Stratagene cat. No. 212207). E. coli strain XLl-Blue (Stratagene 
cat. No. 200268) was transformed with the litation mixture and 
transformed cells were selected on Ap/Tc plates. One clone 
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containing the expected plasmid, as shown by resctriction analysis 
with EcoRl, was isolated and named pBSCB^D, 

Further restriction analysis was performed with Smal, Kpnl , Hind 
III, Sfil, Accl, Notl, Sail, Xhol, EcoRV. Clal, Hinc II. Hind II, 
Seal, Bgl II, Aat 2, Ncol, Nhel, Hpal and Mlul . In addition Southern 

g blotting was performed with Component B specific oligoprobes on 
pBSCB^D after single and double digestions. 
Fig. 6 shows the restriction map of pBSCB4D plasmid. 
Figure 4 shows the restriction map of Component B gene . The 
Component B gene contains 3 exons separated by 2 introns. The exons 

10 are flanked by appropriate consensus acceptor and donor splice 
sites . 

Exon 1 is 84 bp in length and contains 26 nt of untraslated mRNA and 
the sequence coding for 19 amino acids of a putative signal peptide. 
It is separated from exon 2 by an intron of 4lO bp. 
IS Exon 2 is 120 bp in length and codes for 3 amino acids of a putative 
signal peptide and 37 amino acids of the mature protein. It is 
separated from exon 3 by an intron of about 550 bp. 

Exon 3 is 326 bp long; it encodes the C-terminal ^^ amino acids of 
Component B and 192 nt of untraslated mRNA. containing a 
20 polyadenylation signal (TATAAA) 14 bp upstream to the 3' processing 
site, to which end the poly (A) tail is attached. 

In particular, in the three genomic clones the signal peptide 
encoding sequence was found to contain a Leu codon at position 11 of 
the putative signal peptide. 
25 The amino acid sequence of Component B derived from the genomic gene 
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was found to be identical to the one experimentally determined by 
Edman degradation. 

Sequence analysis upstream to exon 1 evidentiated a promoter region 
(Figure 3) containing a TATA box at -28 and various upstream 

5 promoter elements and enhancers, including a GC-rich box at -58. an 
AP-1 site at -83. an AP-2 site at - 360 and several E boxes. 
The TATA box is the preferred binding site for the transcription 
initiation factor TFIID. The GC-rich box represents the binding site 
for Sp-1. a general transcription factor involved in the 

10 transcritpion of a wide variety of genes (Transcription and Splicing 
B.D. Hames & D.M. Glover Eds. IRL press. 1988). 

The AP-1 site is the binding site for AP-1. the transcription factor 
complex formed by c-fos and c-jun. The AP-1 site is present in 
several genes involved in cell growth and differentiation. AP-1 is 
the one of several cis-elements that mediate induction responses to 
activators of protein kinase C (The hormonal control of gene 
transcription P. Cohen & J.G. Foulkes Eds. Elsevier. 1991)- 
The AP-2 site is the target for AP-2. a transcription factor 
activated by PMA and cAMP {ibidem). 

E boxes are common sequences found in several enhancer regions and 
play an important role in determing the tissue-specific expression 
of genes. E box contains the sequence CANNTG. with the two internal 
bases changing according to the specific E box (R. E. Kingston 
Current Opinion Cell. Biol. 1989; 1. 108l - 1087). 

The Component B promoter contains a potential responsive element for 
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glucocorticoid receptor (GRE) , which indicates that the Component B 
gene could be induced by glucocorticoids. 

Subcloning oT Component B gene in a vector for expression in 
mammalian cells 

5 It is known that the expression of rec-proteins in mammalian cells 
may be improved by the presence of intron ( s ) . The Component B 
genomic DNA can be expressed in mammalian cells - 

To this end, a 1364 bp fragment spanning Component B gene from +50 
to +IM13 is excised from pBSCBUD by Pvu II and Nar 1 digestion. 

10 P'ig- 2 shows the restriction map of Component B transcriptional unit 
where Pvu II and Narl sites are based* The entire Component B gene 
is reconstituted by ligation of this fragment with a synthetic 
oligonucleotide reproducing the 5* end of the gene, flanked by a 
suitable restriction site for the subsequent gene cloning in an 

15 eukaryotic expression plasmid. 

Example ^: Isolation of Component B cDNA clones 

Rapid Amplification of cDNA Ends (RACE), a technique described by 
Frohman et al. . (I988) Proc. Natl. Acad. Sci. USA 85, 8998, was used 
to obtain partial cDNA clones corresponding to the 5* and 3* ends of 
20 the Component B mRNA. The partial clones contained overlapping DNA 
sequence and thus could be combined to construct the full-length 
Component B cDNA sequence. A diagram depicting the general strategy 
used for RACE cloning is shown in Figure ?. 

For 3' RACE, the DNA sequence of the second exon of the Component B 
25 gene was available and was used to design the gene specific primer 
CKCBl {5*-TCAAGTGCTACACCTGCAAGGAG-3* ) (SEQ ID NO: 21). cDNA 
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synthesis was primed from the poly A tail of human 
uterus poly A* RNA with the oligonucleotide 5' 
GGCCACGCGTCGACTAGTAC lTi - 1 1 i i 1 1 1 1 1 1 i 1 i i - 3 ' (SEQ ID NO: 2U). called the 
adapter primer of AP. The cDNA was used as the template for a 
polymerase chain reaction (PGR) with the CKCBl and AP prin>ers. which 
produced an approximately 450 base pair (bp) fragment corresponding 
to the 3' end of the Component B cDNA. 

For 5' RACE, the primer CKCB7 (5' -CGTCAGAGAGGAGGTG-3' ) (SEQ ID NO: 
22 was designed from the DNA sequence of the 450 bp 3' RACE fragment 
and was used to prime cDNA synthesis from human uterus poly A* RNA. 
After purification to remove the mRNA and the CKCB? primer, an 
oligodeoxycytidine tail was added to the 3' end of the cDNA. The 
tailed cDNA was used as the template in a PGR with the nested primer 
GKCB2 (5'-ACGGTGAGCAGGGTGGTG-3') (SEQ ID NO: 23) and the anchor 
15 primer (AGP. 5' -CTACTACTAGTAGGCCACGGGTCGACTAGTACGGGIIGGGIIGGGIIG-3' ) 
(SEQ ID NO: 25) or a mixture of the AGP and the universal 
amplification primer (UAP. 5'CTACTACTACTAGGCGAGGGGTCGACTAGTAC-3' ) 
(SEQ ID NO: 26). CKGB2 annealed to the 3' end of the second exon 
sequence and the AGP annealed to the oligodeoxycytidine tail. An 
approximately 23O bp fragment was obtained which contained DNA 
sequence corresponding to the 5' end of the Component B mRNA. 
General experimental protocols (such as polyacrylamide gel 
electrophoresis, ethanol precipitation, ligation, and restriction 
endonuclease digestion), bacterial culture media (such as LB) and 
chemical solutions (such as phenol) used are described in detail in 
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Sambrook et al , (1989) Molecular Cloning: A Laboratory Manual, 2nd 
ed.. Cold Spring Harbor Laboratory Press. New York, unless otherwise 
referenced. 

3* RACE cloning procedure 

The 3' RACE System for Rapid Amplification of cDNA ends was 
purchased from Life Technologies » Inc., Grand Island, NY. Human 
uterus poly A^ RNA was purchased from Clontech Laboratories, Inc. 
Palo Alto, CA. First strand cDNA synthesis was done using the 
protocol and reagents supplied with the 3* RACE system. Briefly, 1 
pi (1 pg) of uterus poly A^ RNA was combined with 1 pi of a 10 pM 
solution of AP and 12 pi of diethyl pyrocarbonate ( DEPC) -treated 
water and the mixture was heated to 65 'C for 10 minutes. After 
chilling the mixture on ice, the reaction components were added so 
that the final composition was approximately 20 mM Tris-HCl (pH 
8.4), 50 mM KCl, 2.5 mM MgCl2. 100 pg/ml bovine serum albumin, 500 
nM AP, 500 pM each of dATP, dCTP. dGTP, and dTTP. and 50 ng/pl RNA 
in a volume of 19 pl • The reaction mixture was heated to ^2**C and 1 
pi (20 units) of Superscript reverse transcriptase was added. After 
incubation for 30 minutes at 42**C, the reaction mixture was chilled 
on ice and 1 pl RNaseK (2 units) was added. RNaseH digestion was 
done for 10 minutes at a temperaure of ^2''C, The reaction mixturre 
was stored at -20 "C prior to the PCR. 

For PCR, four identical 40 pl mixtures were prepared each with the 
following composition: 1 pl of uterus poly A* cDNA in ^0 mM KCl. 70 
25 mM Tris-HCl (pH 8.8). 0.1% Triton X-100, 1 mM MgCl2, 0,25 pM CKCBl, 
and 0.5 pM AP. Reagents from the 3* RACE system were not used for 
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PGR. Both the CKCBl and AP primers were synthesized on an Applied 
Biosystems. Inc. model 392 oligonucleotide synthesizer. After 
deprotection. lyophilization. and resuspension in DEPC-treated water 
the optical density of each solution was measured at a wavelength of 
260 nm. Based on the optical density measurements, a 10 pM solution 
of each oligonucleotide was prepared in DEPC-treated water. The 
crude oligonucleotide solutions were used for PGR with no further 
purification. The concentration of crude AP that produced identical 
results to the AP supplied with the 3 "RACE system was experimentally 
determined: 0.^ uM crude AP was equivalent to 0.2 uM AP from Life 
Technologies, Inc. in the PGR. 

The ^40 pi PGR reactions were heated to 9^°C in a temperature cycler 
before adding to each a 10 \il mixture containing the following: 1.25 
units AmpliTaq DNA polymerase (Perkin Elmer Getus. Norwalk. GT) . 40 
mM KGL. 70 mM Tris-HGl (pH 8.8). 0.1% Triton-X-lOO. 1 mM MgGl2, and 
1 mM each of dATP. dlTP, dCTP. and dGTP. The final concentration of 
each reagent in the PGR was approximately 1 ul uterus cDNA per 50 
pi. 1.25 units AmpliTaq DNA polymerase per 50 pi, 40 mM KGl . 70 mM 
Tris-HCl (pH 8.8). 0.1% Triton X-100. 1 mM MgGl2 . 0.2 pM GKGBl. 0.4 
pM AP. and 0.2 mM each of dATP. dTTP . dGTP. and dGTP. After 
completing a 5 minute incubation at 94 °C. a "Touchdown" PGR 
temperature cycling program was performed according to Don. R.H., 
Cox P.T.. Wainwright. B.J., Baker. K. and Mattick. J.S. (1991) Nucl. 
Acids Res. 19, 4008, by varying the annealing temperature from 73°C 
25 to 63°C. 
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After PGR amplification, the four reactions were combined and the 
DNA products were size-fractionated by electrophoresis on a 5% 
poiyacrylamide gel. A DNA product of aproximately ^50 bp was excised 
from the gel and purified by electroelution in dialysis tubing. The 
5 eluate was extracted with a 50:50 (v:v) mixture of phenol and 

chloroform, ethanol precipitated, dried, and resuspended in 10 yl 
sterile water. 

Due to the template independent terminal transferase activity of Taq 
DNA polymerase and its strong preference for dATP (Clark, J.M, 
10 (1988) Nucl. Acids Res. l6, 9677 and Mole, S.E., Iggo, R.D. and Lane 
D.P. (1989) Nucl. Acids Res. 17. 3319 ). the purified ^50 bp FCR 
fragment was expected to have a single deoxyadenosine residue at 
each 3* end. For subcloning and characterizing the PGR fragment, a 
pBluescriptSK+ (Stratagene, La Jolla, CA) "T-vector" was prepared 
15 essentially as described by Marchuk et al. (1991) Nucl. Acids Res. 

19. 1154 . The pBluescript plasmid (20 pg) was digested with EcoRV 
restriction endonuclease and then purified by extraction with a 
50:50 (v:v) mixture of phenol and chloroform. After ethanol 
precipitation, the DNA was treated with 9 units of Taq DNA 
20 polymerase for 2 hours at 70"Cin a 50 yl reaction containing 50 mM 
KCl, 10 mM Tris-HCl (pH 9.0), 0.1% Triton X-100, I.5 mM MgCl2, and 2 
mM dTTP. The vector was again purified by extraction with phenol and 
chloroform (50:50 v:v) and ethanol precipitation. This procedure 
resulted in the addition of a single deoxy thymidine residue to each 
25 3' terminus, and rendered the vector compatible for insertion of DNA 
fragments synthetized with Taq DNA polymerase. 
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The unphosphorylated 450 bp PCR fragment was inserted into the T- 
vector in a reaction with TU DNA ligase (New England Biolabs. 
Beverly. MA) using the conditions recommended by the manufacturer. 
The ligation reaction was incubated for approximately 72 hours at 
16 °C and then used to transform competent E. coli XLl-Blue cells 
(Stratagene. La Jolla. CA) . The transformed cells were plated onto 
LB agar containing 50 pg/ml ampicillin. Prior to plating the cells. 
100 ul 2% X-gal (Life Technologies. Inc.. Grand Island. NY) and 40 
pi 100 mM IPTG (Life Technologies. Inc.. Grand Island, NY) were 
10 sequentially spread onto the agar surface of each plate and allowed 
to dry. After an overnight incubation at 37°C. colonies with blue 
pigment and unpigmented colonies (white) were visible. Plasmid DNA 
was purified from cultures of 12 white colonies. All 12 isolates 
contained the 450 bp insert. Five clones were chosen for further 
analysis: 3CB4 . 3CB6. 3CB7 . 3CB8 and 3CB9. DNA sequence analysis 
was done using a Sequenase version 2.0 kit (United States 
Biochemical. Cleveland. Ohio). 
5' RACE cloning procedure 

The 5* RACE system for Rapid Amplification of cDNA ends was 
purchased from Life Technologies. Inc.. Grand Island. NY. Human 
uterus poly A* RNA was purchased from Clontech Laboratories. Palo 
alto. CA. The 5' RACE cloning experiments were done using the 
protocol and reagents supplied with the 5' RACE system with the 
following exceptions: (a) the CKCB7 . ACP and UAP primers were 
synthetized on an Applied Biosystem. Inc. model 392 oligonucleotide 
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synthetizer and prepared as described for 3' RACE cloning, and (b) 
the "Touchdown" PGR temperature cycling program described for 3* 
RACE cloning was used for cDNA amplification. First strand cDNA was 
synthetized as follows: 1 pi (1 pg) of uterus poly A"*" RNA was 

5 comnbined with 0,5 pl of a 10 pM solution of CKCB? and 13,5 pl of 
DEPC-treated water and the mixture was heated at 70''C for 10 
minutes. After chilling the mixture on ice, the reaction components 
were added so that the final composition was approximately 20 nM 
Tris-HCl (pH 8,^). 50 mM KCl, 3 mM MgCl2, 10 mM DTT, 200 mM CKCB?. 

10 400pM each of dATP, dCTP. dGTP, and dTTP, and ^0 ng/pl RNA in a 
volume of 2^ pl. The reaction mixture was heated to ^2**C and 1 pl 
(220 units) of Superscript II reverse transcriptase was added. After 
incubation for 30 minutes at 42'c and for 15 minutes at 70"C, the 
reaction mixture was placed at 55 and 1 pl RNaseH (2 units) was 

15 added, RNaseH digestion was done for 10 minutes at a temperature of 
55'c. 

The cDNA was separated from unincorporated dNTPs , CKCB7 , and 
proteins unsing a Glassmax DNA Isolation Spin Cartridge (included in 
the 5' RACE system). Specifically, 120 pl of binding solution {6M 

20 Nal) was added to the first strad reaction, and the CDNA/Nal 
solution was transferred to a GLASSMAX spin cartridge. Following 
centrifugation at 13,000 X g for 20 seconds, 0.4 ml of cold (4*C) IX 
wash buffer was added. The spin cartridge was centrifuged at 13,000 
X g for another 20 seconds. This step was repeated two additional 

25 times. After washing twice with 400 pl of cold (4*C) 70% ethanol, 
the cDNA was eluted by adding 50 pl of sterilized, distilled water 
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to the spin cartridge and centrifuging at 13-000 X g for 20 seconds. 
A homopolymer tail was added to the 3' end of the cDNA using 
terminal deoxynucleotidyl transferase (TdT) and dCTP. The tailing 
reaction was performed in a PCR compatible buffer. 10 yl of 
purified CDNA was combined with 7-5 ul of DEPC-treated water. 2.5 pl 
of lOX reaction buffer. 1.5 pl of a 25 mM solution of MgCl 2 and 
2.5 ul of a 2mM solution of dCTP. The reaction mixutre was incubated 
for 2 to 3 minutes at 9'»°C. After chilling for 1 minute on ice. 1 pi 
f TdT (10 units/uD was added. The final composition was 
therefore: 10 pi cDNA in 20 mM Tris-HCl (pH 8.M . 50 mM KCl. 1.5 mM 
MgCl2. 200 uM dCTP. 0.4 units/pl TdT. The reaction mixture was 
incubated for 10 minutes at 37°C and then for 10 minutes at 70°C to 
inactivate the TdT. 

For the PCR. four different reactions were prepared with the 
following final primer concentrations {per 50 pi): 

1. 400 nM ACP 

2. 800 nM ACP 

3. 360 nM UAP and 40 nM ACP (UAPtACP. 9:1) 
^. 720 nM UAP and 80 nM ACP (UAP:ACP. 9:1) 

The final concentrations (per 50 pD of the remaining components in 
all four reactions were identical: 5 pl of uterus poly A* dC-tailed 
CDNA in 20 mM Tris-HCl (pH 8.4). 50 mM KCl, 1.5 mM MgCl2. 400 nM 
CKCB2. and 200 pM each of dATP. dCTP. dGTP and dTTP. The components, 
including the ACP and UAP primers, were mixed in an initial volume 
of 45 pl and heated to 94°C in a temperature cycler. A 5 pl mixture 
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of 1,25 units of AmpliTaq DNA polymerase (Perkin Elmer Cetus, 
Norwalk, CT) in 20 mM Tris-HCl (pH 8.4) was then added to each 
reaction to bring the final volume to 50 pi. The "Touchdown" PGR 
temperature cycling program (as described for 3* RACE cloning) was 

5 used to amplify 5' cDNA fragment. 

After PCR amplification, the four reactions were combined and the 
DNA products were size- fractionated by electrophoresis on an 8% 
polyacrylamide gel, A DNA product of approximately 230 bp was 
excised from the gel, purified, and subcloned into the "T-vector" as 

10 described for the 450 bp 31 RACE fragment. Five clones were chosen 
for further analysis: 5CB2, 5CB3, 5CB5, 5CB6, 5CB11. DNA sequence 
analysis was done using a Sequenase version 2.0 kit (United States 
Biochemical, Cleveland, Ohio). 

Figure 8 reports the complete Component B cDNA sequence assembled 
15 from RACE clones 5CB3 and 3CB7» in which the restriction sites are 
indicated . 

By sequence alignment the cDNA sequence of clones 5cb3 . 5cb6, 5cbll 
and 3cb4 , 3cb7, 3cb9 were shown to perfectly match with the 
Component B exons (genomic clone 4D of Example 3) • 

20 Although the present invention was illustrated with specific 
examples, it is clear that modifications can be performed on the 
operations as described always remaining within the spirit and scope 
of the invention. 
LEGENDS TO FIGURES 

25 Figure 1. Flow chart of the process of Component B from urine. 

Figure 2. Restriction map of Component B genomic transcriptional 



BNSDOCIO: <WO 94 1 49S9A1 _l_> 



wo 94/14959 



PCT/EP93/03645 



- 36 - 

unit (such Component B genomic DNA reported in SEQ ID NO: 2). 
Arrows indicate the splicing sites. 

Figure 3. Sequence of Component B promoter region {said Component B 
promoter region reported in SEQ ID NO; 2). Binding sites for AP-1, 
5 AP-2. Sp-1 and E-boxes transcription factors are indicated. TATA box 
is indicated. GRE is also indicated. 

Figure 4. Restriction map of Component B gene. The derived mRNA is 
shown below the genomic gene by a line, the boxed region represents 
the protein encoding sequence. 
10 Figure 5. Restriction map of clone ^D insert. 
Figure 6. Restriction map of pBSCB^D plasmid. 

Figure 7, General strategy used for RACE cloning of Component B DNA 
sequence. 

Figure 8, Complete Component B cDNA sequence, in which the 
15 restriction sites are indicated (said Component B cDNA reported in 
SEQ ID NO: 3) . 
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CLAIMS 

1 1. Polypeptide comprising the peptide sequence of sequence SEQ ID 

2 NO: 1, its salts, functional derivatives, precursors and active 

3 fractions or mixtures thereof. 

1 2. Process for the production of the polypeptide according to claim 

2 1 comprising the following steps: 

3 a) adsorption of urine, at: pH acid, on kaolin and extraction with 

4 ammonia 

5 b) elution of fraction (a) on Bio Rex 70 resin with ammonia 

6 c) elution of fraction (b) on DEAE Sepharose resin with acetate 

7 buffer 

8 d) elution of fraction (c) on CM Sepharose resin with acetate buffer 

9 e)elution of fraction (d) on HPLC Cl8 resin with a mixture of 

10 acetate buffer and acetonitrile 

11 f) elution of fraction (e) on DE-52 resin with acetate buffer 

12 g) elution of fraction (f) on D-Zephyr resin with acetate buffer 

13 h) elution of fraction (g) on HPLC Cl8 resin with a mixture of 

14 aqueous trifluoroacetic acid and acetonitrile 

15 i) elution of fraction (h) on D-Zephyr resin with acetate buffer. 
1 3. Process according to claim 2 wherein the urine is human urine. 

i». DNA molecule comprising the DNA sequence coding for the 
polypeptide according to claim 1. its mutant or active fraction. 
5. DNA molecule which hybridizes with the DNA molecule according to 
claim H and codes for a polypeptide according to claim 1, its mutant 
3 or active fraction. 



1 

2 
1 

2 
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1 6. Genomic DNA molecule comprising the nucleotide sequence of SEQ ID 

2 NO: 2. 

1 7. cDNA molecule comprising the nucleotide sequence of SEQ ID NO: 

2 3. 

1 8. Expression vector comprising the DNA molecule according to any of 

2 claims 4 to 7- 

1 9. Host cell transformed with an expression vector according to 

2 claim 8. 

1 10. Process for the production of the polypeptide according to claim 

2 1 comprising the culture of a host cell transformed according to 

3 claim 9 and the recovering of the protein from said cell or from the 

4 culture medium. 

1 11. Essentially pure protein, its salts, functional derivatives. 

2 prodrug and active fractions, obtainable through a process 

3 comprising the following steps: 

o • « =.1- acid on kaolin and extraction with 

4 a) adsorption of urine, at pH acia. on k. 

5 ammonia 

6 b)elution of fraction (a) on Bio Rex 70 resin with ammonia 

7 c)elution of fraction (b) on DEAE Sepharose resin with acetate 

8 buffer 

9 d) elution of fraction (c) on CM Sepharose resin with acetate buffer 
,0 e)elution of fraction (d) on HPLC Cl8 resin with a mixture of 

11 acetate buffer and acetonitrile 

12 f) elution of fraction (e) on DE-52 resin with acetate buffer 

13 g) elution of fraction (f) on D-Zephyr resin with acetate buffer 

1^ h) elution of fraction (g) on HPLC Cl8 resin with a mixture of 
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15 aqueous trif luoroacetic acid and acetonitrile 

16 i) elution of fraction (h) on D-Zephyr resin with acetate buffer. 
I 12. Polypeptide according to claim 1 for use as a medicament. 

1 13. Use of the polypeptide according to claim 1, for the preparation 

2 of a medicament having anti-inflammatory and/or anti-coagulant 

3 and/or anti-tumoral activity; 

1 14. Pharmaceutical composition comprising a therapeutically 

2 acceptable quantity of a polypeptide according to claim 1. or its 

3 salt, functional derivative, precursor, active fraction or mixture 
^ thereof in combination with one or more therapeutically acceptable 
^ eccipient or eluent. 

^ 15. Protein according to claim 11 for use as medicament. 

^ 16. Pharmaceutical compositions comprising a therapeutically 

2 acceptable quantity of a protein according to claim 11. or its 
salts, functional derivatives, precursors, active fractions or 

^ mixture thereof in combination with one or more pharmaceutically 

5 acceptable eccipients or eluents. 
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MALE CONCENTRATED URINE 
i BIO -REX 70 '; 

1 

i DEAE - SEPHAROSE _F.F._ J 

. . J. 

r CM - SEPHAROSE F.F. 

: C18 RP-HPLC i 
STEP 6 \ DE-52 i 
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D - ZEPHYR 
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C18 RP-HPLC 
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STEP 
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D -ZEPHYR 



STEP 1 

STEP 2 
STEP 3 
STEP 4 
STEP 5 



Fig.l 

Flow chart of the purification of Component B 
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FIGURE 2 : Restriction map of Component B genomic 
transcriptional unit 



ATG, GCC TCT CGC TGG GCT 595 
Met Ala Ser Arg Trp Ala 
-22 -20 

P 

u 

2 

GTG CAG CTG CTG CTC GTG GCA GCC TGG AGC ATG GGC TGT G GTGAGTGGGC 645 

Val Gin Leu Leu Leu Val Ala Ala Trp Ser Met Gly Cys 

-15 -10 -5 



S 
T 

1 

CGCAGGCTGG TGGGGACCTT GCCTCTGAGC TTGTCTGCCC ACCTCCTAGG GGGATGGGGC 705 



M S 
ST 
T U 
2 1 

TGTTGGGGGT GCTTTGTGGC TGAGAGCCTC CTTAGGCCTC CATGAGGCTC ACCCTCCTCA 765 



B 
A 
N 
2 

TTCTCAGTGA GCCTCCTGGG TCCCAGAGCC CAGCTTCACC CTGGGACAGG GGTCACGGCT 82 5 



P E 
S S 
T p 

i i 

CCACTCTGCA GGAAGGGAGA CTGAGGCTTG GTGGAGGGAT GCAGCATTCA AGTCTGTGGC 885 
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A S 

V M 

A A 

1 1 



TCAGCTCAGT TAGAGAAAGC TGCCAGAGAG GCCCCTTGAA GGSCTGCCCG GGGCCTTGAA 
AGATGTCAGC GAGACTCCTT CAGCCCCTGC CTCCTGGTTC CAGGATGAGV CCACCGAGGT 

CAGGTGAXGA GGTTCTGCCC CCATCCCTCA CCCAG ^^-^^^ -^^/?^3 

-3 1 



B P 

A S 

N T 

2 1 

TAG ACC TGC AAG GAG CCC ATG ACC AGT GCT TCC TGC AGG ACC ATT ACC 
Tyr ?hr cys Lys Glu Pro Met Thr Ser Ala Ser Cys Arg Thr He Thr 

5 . 10 15 

CGC TGC AAG CCA GAG GAC ACA GCC TGC ATG ACC ACG CTG GTG ACG GTG 
Arg cys Lys Pro Glu Asp Thr Ala Cys Met Thr Thr Leu Val Thr Val 

GAG GCA G GTGAGGCCAG GCCCCACGGC AGCCCTGGGT GCAGTGGAGT CAGGGCCACC 
Glu Ala 

TCCCCCAAGT GCGTCCCTCC TTTGCTGGTG CTCCTCCCGG CCCAAAAGGA AGCAGGTGGG 



S 
T 

y 

1 

ATGGGCAGAA CAGGCTGCCA CACCTTGGCA GGGGTGCCTT CCACGAGGGT GGCACAGCCC 
CCTCAGAGAC CCAGTCCTGG GGCACCAGGC GCTGGAGGTG GGTGGGGCTT AATGGCCGGG 

K B 
? A 
N N 
1 2 

GTACCCTGGG GGGCTCAAAC CCCAGCTCTG ACACAGACCC ACTGGGTGGT GTTGCCACAG 



945 
1005 
1057 



1105 



1153 



1210 



1270 



1330 
1390 



1-450 



wo 94/14959 



PCT/EP93/03645 



4/1; 



AB 
VA 
AN 
12 



B 
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N 
2 



U 
1 



CCTCTGGGCT CGGGCTCCCA TCTCAGCGCA GGCACTTCAG AGGTCTGACA AGGCCTAATA 1509 

3. 3 
? A 
N 
2 

ATTCATGAAC AGGTCACAGT CAGAGGAGGG CTGGGCCCTG GGTGGCTTCA CAGATGTGGA 1570 
CTATTGGGAA CAGGGATCAC AGGGAGGKTG AGGTCAGSCG ACGGCGGCTG GGAGCAGTGC 1630 



P 
S 
T 
1 



P 
S 
T 
1 



AGCAGCAGGC AGGCGCTGCA GGGGAGTGAG GGTTCTGACA CTGGCCCACC CTGCAG AG 

Glu 



1688 



8 
A 
N 



B 

S 
T 
E 



TAG CCC TTC AAC CAG AGC CCC GTG GTG ACC CGC TCC TGC TCC AGC TCC 
Tyr Pro Phe Asn Gin Ser Pro Val Val Thr Arg Ser Cys Ser Ser Ser 
40 45 50 



1736 



B 
A 

L 
1 



TGT GTG GCC ACC GAC CCC GAC AGC ATC GGG GCC GCC CAC CTG ATC TTC 
Cys Val Ala Thr Asp Pro Asp Ser lie Gly Ala Ala* His Leu lie Phe 
55 60 65 70 



1784 
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TGC TGC TTC CGA GAC CTC TGC AAC TCG GAA CTC TGAACCCAGG GCGGCAGGGC 1837 
Cvs CVS Phe Arg Asp Leu Cys Asn Ser Glu Leu 
75 80 



M 
S 
T 
2 



GGAAGGTGCT CCTCAGGCAC CTCCTCTCTG ACGGGGCCTG GCTCCACCTG TGATCACCTC 1897 
CCCCTGCTTC CTGCTGCTGT GGCACAGCTC ACTCATGGGG TCTGAGGGGA GAGAAGCACA 1957 



N 
A 
R 

1 



CCAGGGGCGC CCTCTGCCTT CCATACCCCA CGCTTATAAA ACATAACTAA GCCAAGCGTG 
GACATGACTT TTGT 



2017 
2031 
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FIGURE 3 : Sequence of Component B promoter region 



E box E box E box 



TGGCCCATGC TACCCTCACC TGACACCTGC TTCCTACCTC TGGTTTCTAC TTTGCAGGTG 60 



E box 



TGTATCAGGT GTACACAGAC CAGGTAGAGG TCTGTGGAGA GGGCTGCAGG CCAGGCTGCA 120 
GGGAAGGGGT GCCAGGCGGG GCTAGAGCAA CAAGGGCAGA GGCTACACTG AACCTGGGTC 180 

AP-2 



YTAAGGGTCC CCCAGGCTGG GGCTGGGTGG CCTATGTGAA CCCCAGAGGC ACAGCCAGGA 240 
CATGGGGGCT CATCAGAGGG GCAGTCTGAG CTCAGCAGGA AAGGCCTTCT CTGTCAGAGC 300 

GRE 



TGTCCCAGGA CCACTGGACA TGGCTGAGGA ACAGTGAGTT CCCCAGTGTT GGAGGTGTGC 360 
AAGCAGAGGC CTGGCCATCG TCCTCAGACA CAGCTCCCAG ATCCAGCTCC CTGCCCGTCT 420 

E box AP-1 



GCCATGTTCC TGCCAGCTGC CTCCCCACTG GGCCCTTTAC CACGTTCCTG ACTCACACGG 480 
Sp-1 TATA box 



CCGGTTCTGC CACCGCCCAG AAGCCGGTGC CCAAGGGCCT GGCTATAAAT CCTTGATGTG 540 
AGGCTGGCTA CCTCTCATCA CTTCTGAGCA CGGAGCA 
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Pig- 6 Restriction map of pBSCB4D 
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P,c;UR£ 8 : The complete Component B cDNA sequence assembled from 
Race clones 5CB3 and 3CB7 
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CTC GtG GCA GCC TGG AGC ATG GGC TGT GGT GAG GCC CTC AAG TGC TAG 
III Hi IT. Ala Xrp ser Me. Gly Cys Gly Glu Ala Leu Lys Cys Tyr 
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TGC AAG CCA GAG GAC ACA GCC TGC ATG ACC ACG CTG GTG ACG GTG GAG 
Cy^ pro G^u ASP Thr Ala Cys Met Thr Thr Leu Val Thr Val Glu 



51 



99 



ACC TGC AAG GAG CCC ATG ACC AGT GCT TCC TGC AGG ACC ATT ACC CGC 147 
rll cys Glu pro Met Thr Ser Ala Ser Cys Arg Thr He Thr Arg 

10 i^ 
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GCGGCAGGGC GGAAGGTGCT CCTCAGGCAC CTCCTCTCTG ACGGGGCCTG GCTCCACCTC 400 

B 

c 
1 
I 

TGATCACCTC CCCCTGCTTC CTGCTGCTGT GGCACAGCTC ACTCATGGGG TCTGAGGGGA 460 
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B H 
BsN a 
aaa e 
nHr I 
III I 

GAGAAGCACA CCAGGGGCGC CCTCTGCCTT CCATACCCCA CGCTTATAAA ACATAACTAA 



GCCAAAAAAA AAAAAAAAAA 



540 
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